# Lightweight large model
A.X 4.0 Light Gguf
Apache-2.0
A.X 4.0 Light is a lightweight large language model developed by SKT AI Model Lab, built on Qwen2.5 and optimized for Korean understanding and enterprise deployment.
Large Language Model
Transformers Supports Multiple Languages

A
mykor
535
2
Molmo 7B D Bnb 4bit
Apache-2.0
Molmo-7B-D is a large language model quantized with BnB 4-bit. The model size is reduced from 30GB to 7GB, and the video memory requirement is reduced to about 12GB.
Large Language Model
Transformers

M
cyan2k
1,994
17
Phi 3 Mini 4k Instruct Q4
Phi-3 4k Instruct is a lightweight yet powerful language model, processed with 4-bit quantization to reduce resource requirements.
Large Language Model
Transformers

P
bongodongo
39
1
Meta Llama 3 8B Instruct Q4 K M GGUF
Other
The GGUF quantized version of the Llama 3 8B instruction model, suitable for local inference and supporting efficient deployment
Large Language Model English
M
NoelJacob
1,131
1
Minicpm 2B 128k
MiniCPM is an edge-side large language model jointly developed by FaceWall Intelligence and Tsinghua University's Natural Language Processing Lab, with only 2.4 billion non-word embedding parameters (2.4B) and supports a 128k context window.
Large Language Model
Transformers Supports Multiple Languages

M
openbmb
145
42
Mobilevlm 3B
Apache-2.0
MobileVLM is a fast and powerful multi-modal vision-language model designed specifically for mobile devices, supporting efficient cross-modal interaction.
Text-to-Image
Transformers

M
mtgv
346
13
Firefly Bloom 1b4
Open-source Chinese conversational large language model optimized with instruction fine-tuning, specializing in Chinese cultural tasks, with 1.4B/2.6B parameters
Large Language Model
Transformers

F
YeungNLP
55
23
Featured Recommended AI Models